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ABSTRACT 

A generic prediction of hierarchical clustering models is that the mass function of 
dark haloes in dense regions in the Universe should be top-heavy. We provide a novel 
test of this prediction using a sample of galaxies drawn from the Sloan Digital Sky 
Survey. To perform the test, we compare measurements of galaxy clustering in dense 
and underdense regions. We find that galaxies in dense regions cluster significantly 
more strongly than those in less dense regions. This is true over the entire 0.1-30 Mpc 
pair separation range for which we can make accurate measurements. We make similar 
measurements in realistic mock catalogs in which the only environmental effects are 
those which arise from the predicted correlation between halo mass and environment. 
We also provide an analytic halo-model based calculation of the effect. Both the mock 
catalogs and the analytic calculation provide rather good descriptions of the SDSS 
measurements. Thus, our results provide strong support for hierarchical models. They 
suggest that, unless care is taken to study galaxies at fixed mass, correlations between 
galaxy properties and the surrounding environment are almost entirely due to more 
fundamental correlations between galaxy properties and host halo mass, and between 
halo mass and environment. 

Key words: methods: analytical - galaxies: formation - galaxies: haloes - dark matter 
- large scale structure of the universe 



1 INTRODUCTION 



The correlation between galaxy properties (morphology, star 
formation rates, luminosity, color etc.) and the surrounding 
environment has been the subject of extensive studies in 
the last few decades: dense environments are preferentially 
occupied by elliptical, red, luminous galaxies, whereas star 
formation rates are higher in less dense regions (Dressier 
1980; Butcher & Oemler 1984; Norberg et al. 2001, 2002; 
Balogh et al. 2002; Gomez et al. 2003; Hogg et al. 2004; 
Kauffmann et al. 2004; Berlind et al. 2005; Croton et al. 
2005). In hierarchical models, this behaviour is expected to 
be a consequence of the fact that galaxies are surrounded by 
dark matter halos, and the properties of halos (mass, forma- 
tion time, concentration, internal angular momentum, etc.) 
are correlated with their environments (Mo & White 1996; 
Sheth & Tormen 1999, 2002, 2004; Lemson & Kauffmann 
1999; Gottloeber et al. 2001; Avila-Reese et al. 2005; Gao et 
al. 2005; Harker et al. 2006; Wechsler et al. 2006). 
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(UA); 



Recently, we described how the clustering of galaxies 
can be used to test the assumption that the correlations 
between galaxy properties and their environments are en- 
tirely a consequence of the correlations between haloes and 
their environments (Abbas & Sheth 2005). This is a strong 
assumption which significantly simplifies interpretation of 
the observed luminosity dependence of galaxy clustering 
(e.g. Zehavi et al. 2005). It is also a standard assumption 
in current halo-model descriptions of galaxy clustering (see 
Cooray & Sheth 2002 for a review). The main goal of this 
paper is to perform this test. 

This paper is arranged as follows: in Section 2 we show 
how galaxy clustering depends on environment in the SDSS 
(Adelman-McCarthy et al. 2006). In particular, we measure 
the pair correlation function in redshift space, £(s|<5 3 ), for 
a range of environments 5 S , as well as the projected quan- 
tity, w p (r p \8s); the latter is free of redshift-space distortions. 
These measurements are compared with similar measure- 
ments in carefully constructed mock catalogs, and from an 
analytic calculation based on the halo-model. In both the 
mocks and the analytic calculation, correlations between 
galaxy properties and environment are entirely a conse- 
quence of the correlation between galaxy properties and halo 
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masses, and between halo mass and environment. We sum- 
marize our results in Section 3, where we also discuss some 
implications. An Appendix provides details of the analyti- 
cal model, which generalizes our earlier (real-space) work so 
that it can be used to model redshift space measurements 
as well. 



2 THE ENVIRONMENTAL DEPENDENCE OF 
CLUSTERING 

To measure the environmental dependence of clustering, we 
must decide on a measure of the environment. Abbas & 
Sheth (2005) showed that the precise choice of environment 
is not particularly important, in the sense that different 
choices lead to quantitative but not qualitative differences. 
They used Nr, the number of galaxies in a sphere of radius 
R centred on a galaxy, as a measure of that galaxy's environ- 
ment, and presented results for R = 5 and 8/i~ Mpc. Their 
analysis was performed in real-space. Performing a similar 
analysis in redshift-space is complicated because the envi- 
ronmental effect we would like to test is due to correlations 
between halo masses with the real-space density. Therefore, 
we must find a definition of density in redshift space which 
is as faithful as possible to that in real-space. 

Line-of-sight redshift-space distortions can make a 
sphere in real-space appear very different in redshift space. 
For instance, around a spherically symmetric cluster there 
are two main effects: one is due to coherent infall around 
the center of the cluster, which appears as a squashing effect 
along the line of sight in redshift space (Kaiser f987). The 
second is the "finger of God" effect which is due to the virial 
motions of galaxies within the cluster (de Lapparent et al. 
f986). This shows up as an elongation of the cluster along 
the line of sight. The squashing effect is relatively small, 
producing effects of order unity or less, whereas the finger- 
of-god distortions are more dramatic — elongations along the 
line of sight are typically about a factor of ten. Since clus- 
ters have radii of a Mpc or so, fingers of god can extend up 
to about fO Mpc. Therefore, while counts in redshift space 
spheres of radii 5/i _1 Mpc are not expected to faithfully trace 
the counts in the corresponding real-space spheres, counts 
in spheres of radii 8fe _1 Mpc, As should be more similar. 
For this reason, in what follows we use Ns as a measure of 
the environment of each galaxy. (If we wished to push to 
smaller scales, we could identify all the fingers of god, and 
then "decompress" them, by rescaling the distances along 
the line-of-sight so that they have the same extent as across 
the line-of-sight e.g. Tegmark et al. 2004. But performing 
such a "manicure" is beyond the scope of the present work.) 

We use Ns to divide the galaxy population into three 
equal-sized subsamples: the third with the largest values of 
Ns are defined as being the dense subsample, and the third 
with the smallest values of Ns are the underdense subsam- 
ple. We then measure the correlation functions in these two 
subsamples. 

Our strategy is to make such measurements in a volume 
limited galaxy catalog, so that selection effects are mini- 
mized. We then compare with similar measurements in real- 
istic mock catalogs. Throughout, we show results for a flat 
ACDM model for which (fi , h, a 8 ) = (0.3, 0.7, 0.9) at z = 0. 
Here Qo is the density in units of critical density today, the 
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Figure 1. Pic-diagrams of the SDSS subsamples: top and bot- 
tom panels show the distributions of objects classified as being 
in the least dense and the densest regions, in a slice of thickness 
100 h _1 Mpc through the survey volume. 

Hubble constant at the present time is Ho = 100ft km s _1 
Mpc -1 , and as describes the rms fluctuations of the initial 
field, evolved to the present time using linear theory, when 
smoothed with a tophat filter of radius 8/i -1 Mpc. 



2.1 The SDSS galaxy sample 

We perform our analysis on a volume limited catalog ex- 
tracted from the SDSS DR4 database (Adelman-McCarthy 
et al. 2006). We chose galaxies brighter than M r < —21, to 
match the analysis of Zehavi et al. (2005), whose results we 
use below. The resulting catalog contains about 75000 ob- 
jects with accurate angular positions and redshifts, where 
the number density is 0.00117 (ft _1 Mpc) -3 . 

As discussed above, we define the environment of each 
object using the redshift-space information. Figure 1 shows 
the spatial distribution of galaxies in a thin slice through the 
SDSS volume limited catalog. Top panel shows the objects 
classified as being in the least dense regions, and bottom 
panel are for the objects in the densest regions. The galax- 
ies in dense regions are clearly strongly clustered on small 
scales, whereas those in the underdense regions populate the 
holes defined by the spaces between the clusters that one sees 
in the dense sample. 

The following sections quantify these differences by 
measuring the correlation functions in these subsamples. 
Uncertainties on our measurements were estimated by jack- 
knife resampling, in which the statistics were remeasured 
after omitting a random region, and repeated thirty times 
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(approximately 1.5 times the total number of bins in sepa- 
ration for the results presented). 



2.2 Mock galaxy samples 

We have generated realistic mock galaxy samples as follows. 
We start with the Very Large Simulation (VLS; Yoshida, 
Sheth & Diaferio 2001), kindly made available to the public 
by the Virgo consortium. It has 512 3 particles in a cubic 
box with sides L = 479/i" 1 Mpc. About 800,000 dark mat- 
ter halos, each containing at least 10 particles, were identi- 
fied in this particle distribution using the Friends-of-Friends 
method. We use the simulation output for the mass, position 
and velocity of each dark matter halo. 

We use the results of Zehavi et al. (2005) to motivate our 
choice for how mock galaxies should be distributed within 
each halo. Specifically, to model a volume limited galaxy cat- 
alog with objects more luminous than L, halos less massive 
than rriL are assumed to contain no galaxies; mi depends on 
the galaxy population under consideration. Galaxies more 
massive than tul, contain one central galaxy, and may also 
contain satellite galaxies. The number of satellites is drawn 
from a Poisson distribution with mean (N s \m), where 
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( Ns \m) = [ — 
mi 



if m > rriL- 



(1) 



For SDSS galaxies more luminous than M r < —21, rriL = 
10 12 ' 72 h -1 M Q , mi = 23m L , and, a = 1.39 (Zehavi et al. 
2005). (A Poisson distribution for the number of satellites 
is motivated by the work of Kravtsov et al. 2004) . We then 
assume that the satellites in a halo are distributed around 
the halo center similarly to the dark matter (e.g. Navarro et 
al. 1997). 

To model redshift space effects, we must model the 
velocity vector of each mock galaxy. We do so by assum- 
ing that iig a i = «h a i + « v ir, where Whaio is the halo mo- 
tion provided by the simulation, and ivir is obtained as fol- 
lows. The central galaxy in a halo is assumed to be at rest 
with respect to the halo, so v v i T = 0. The virial motions of 
satellite galaxies are modelled by assuming that haloes are 
isotropic, virialized, and isothermal with Maxwellian veloc- 
ities around the halo center. The one-dimensional velocity 
dispersion is 1000 (r2oo VMpc)/v2, where r2oo is the scale 
on which the enclosed mass is 200 times the critical density: 
m = 200pcrit (47rr|oo/3). Following results in Sheth & Diafe- 
rio (2001), we assume that this virial term is independent of 
local environment. 

In the distant observer approximation, the position in 
redshift space is s = x + v x /Ho, where x is the real-space 
coordinate in the ^-direction, v x is the ^-component of the 
peculiar velocity, and s is the redshift-space distance in the 
^-direction. The y- and z- components of the position are 
unchanged. The isothermal Maxwellian assumption means 
that the virial motions add Gaussian noise to the line-of- 
sight position of each satellite galaxy. 

We then measure N$ for each galaxy by counting the 
total number of galaxies within 8/i. _1 Mpc. For the mock cat- 
alog, we can do this in both real- and redshift-space. Figure 2 
compares these two estimates of the local density. They are 
not widely different, suggesting that the analysis in the Ap- 
pendix will be useful. 
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Figure 2. Comparison of local density estimates within real (bot- 
tom) and redshift (top) space spheres of radius 8h — x Mpc. The 
median (given by the squares) and quartile range of halo mass 
corresponding to certain number of galaxies is shown (for clarity 
these points have been shifted upwards). 



2.3 Results 

Figure 3 quantifies the spatial differences seen in Figure 1; it 
shows the redshift space correlation functions in dense and 
underdense regions measured in the mock catalogs (upper 
panel) and in the SDSS volume limited catalog (lower panel). 
In both panels, £(s|<5 s ) for the dense sample is significantly 
larger than it is in the underdense sample. On large scales, 
this is because dense regions host the most massive haloes 
which in turn contain many galaxies; on smaller scales, the 
fact that the halo density profiles depend on halo mass also 
matters (Abbas & Sheth 2005). The inflection or break at 
the scale on which we define the environment (8/i _1 Mpc), 
which is seen in the clustering signal for underdense regions, 
arises because this scale is significantly larger than the virial 
radius of a typical halo. Let R denote the scale on which 
the environment is defined. Then, pairs which come from 
different halos are of two types: those separated by scales 
smaller than R are said to be in the same patch, whereas 
more widely separated pairs are in different patches. Abbas 
& Sheth (2005) called these the 2h - lp and 2h - 2p con- 
tributions to the statistic. Now, by definition, there are no 
2h — 2p pairs with separations smaller than 8/i _1 Mpc, so 
(,2h-2p = — 1 on smaller scales. In addition, underdense re- 
gions are those with small Nr, so they have few pairs in the 
2h— lp term by definition. In the limit in which there is only 
one halo in each underdense patch (i.e., the one surrounding 
the galaxy around which the patch was centered) , there will 
be no 2h — lp pairs. In this limit, the correlation function is 
the sum of the lh term, which falls rapidly on scales larger 
than the virial radius (a few Mpc) and the 2h — 2p term 
(which is only significant on scales larger than the patch 
radius). Therefore, in this limit, if R is significantly larger 
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Figure 3. Environmental dependence of the galaxy correlation 
function in redshift space. Upper panel shows measurements in 
the mock catalog, and lower panel shows measurements in the 
SDSS. In both cases, the galaxy catalog is volume limited to M r < 
—21, and the environment of a galaxy was denned by counting 
the number of galaxies within a redshift-space sphere of radius 
8h~ Mpc centred on it. The squares and triangles in each panel 
show £(sj<5 s ) for the galaxies in the densest 1/3 and least dense 1/3 
of the sample. Solid curves show the analytic model for £(s|5 s ) 
that is developed in Appendix A. Dashed curves in the upper 
panel show the 1-halo (dominates on small scales) and the sum 
of the 2h-lp and 2h-2p contributions (dominate on intermediate 
and large scales, respectively) to §(s|<5s) of the less-dense sample. 
In the lower panel, the dashed curves show these contributions 
for the dense sample. 

than the virial radius of a typical halo, there will be a dra- 
matic feature in £ at scale R. As the number of 2h — lp 
pairs increases, this feature becomes less obvious. Indeed, 
in dense regions — those which have larger Nr and so have 
more 2h — lp pairs, there is little evidence of this feature. 

The solid curves in the two panels show the analytic cal- 
culation outlined in the Appendix. They provide a reason- 
able description of the measurements in both panels. How- 
ever, while the agreement is good on large scales, the curves 
underestimate the small-scale signal in dense regions. Since 
these smaller scales are the ones most affected by finger-of- 
god distortions, it may be that the discrepancy is due to 
inadequacies in the analytic treatment of redshift-space ef- 
fects (see Scoccimarro 2004 for a discussion of the sorts of 
effects our analysis ignores). 

To eliminate this source of uncertainty, we have also 
studied the projected quantity 

r oo 

w p (r p \5 s )=2 dir£(r p ,Tv\8 s ); (2) 
Jo 

where r = s/7p + 7r 2 . We integrate up to it = 35 /i~ 1 Mpc, 
which is large enough to include most correlated pairs. Fig- 
ure 4 shows the results, both in the mock catalog (top panel) 
and in the SDSS (bottom panel). Now, the agreement with 



Figure 4. Similar to Figure 3, but now for the projected galaxy 
correlation function. The upper panel shows measurements in the 
mock catalog, and the lower panel is for the SDSS. Symbols and 
line-styles are the same as for Figure 3. 

the analytical model is very good, suggesting that our an- 
alytic treatment of redshift-space distortions is inadequate. 
Once again, the inflection at the scale of the patch size for 
the underdense sample is caused due to the transition from 
one type of 2-halo term to the other. 

Both for £(s|<S s ) and w p (r p \8 s ) the differences between 
the two environments are dramatic — they are measured 
with high statistical significance. Nevertheless, the analytic 
model, which only incorporates those correlations with envi- 
ronment which arise from the correlation between halo mass 
and environment, provides an excellent description of the 
measurements. This leaves little room for other environmen- 
tal effects. 



3 DISCUSSION AND CONCLUSIONS 

One of the luxuries of the latest generation of large-scale sky 
surveys is that they contain sufficiently many objects that 
one can study subsamples of galaxies divided up in various 
ways. Here, we have focused on the clustering of galaxies in 
a volume limited sample drawn from the SDSS, and studied 
how the clustering of these galaxies depends on environment. 
We find that galaxies in dense regions are considerably more 
strongly clustered than those in less dense regions (Figures 3 
and 4). 

This is perhaps not so surprising — after all, a dense re- 
gion is one in which many galaxies are crowded together. 
What is more surprising is that this dependence on envi- 
ronment is very well reproduced by numerical (Section 2.2) 
and analytic (Appendix A) models in which the entire effect 
is due to the fact that galaxy properties correlate with the 
masses of their parent halos, and massive halos preferentially 
populate dense regions. Hierarchical models make quantita- 
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tive predictions for this correlation between halo mass and 
environment, and so the agreement between our models and 
the measurements provides strong support for such models. 
In this respect, our results are consistent with those of Mo 
et al. (2004), Kauffmann et al. (2004), Berlind et al. (2005), 
Blanton et al. (2006) and Skibba et al. (2006); this is reas- 
suring, since our methods are very different. 

Our test of environmental effects is particularly inter- 
esting in view of recent work showing that, at fixed mass, 
haloes in dense regions form earlier (Sheth & Tormen 2004) , 
and that this effect is stronger for low mass haloes (Gao et 
al. 2005; Harker et al. 2006; Wechsler et al. 2006). Such a cor- 
relation is not part of our analytic model, nor is it included 
in our mock catalogs. Presumably, the good agreement with 
the SDSS measurements is due to the fact that we have con- 
centrated on luminous galaxies, and these populate the more 
massive haloes. It will be interesting to see if this agreement 
persists at lower luminosities. 

The agreement between our models and the measure- 
ments has an important consequence: Unless care is taken 
to study a population at fixed halo mass, our results indi- 
cate that observed correlations between gastrophysical ef- 
fects (e.g. ram pressure stripping, strangulation, harrass- 
ment) and environment are dominated by the fact that these 
effects also correlate with halo mass, and halo mass corre- 
lates with environment. 

Larger samples will allow us to study if these trends per- 
sist to fainter, presumably less massive galaxies. And more 
distant samples will allow us to study if these trends evolve. 
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APPENDIX A: THE ANALYTICAL MODEL 

This Appendix discusses how the halo model calculation of 
environmental effects on clustering can be extended to in- 
clude redshift-space effects. Our strategy is to combine the 
halo- model description of redshift space effects (White 2001; 
Seljak 2001) with the halo model description of environmen- 
tal effects provided by Abbas & Sheth (2005). 

In redshift space, two effects modify the real space ex- 
pressions derived by Abbas & Sheth (2005). One of these 
is a boost of power on large scales due to the instream- 
ing of matter into overdense regions (Kaiser 1987); this af- 
fects the 2-halo terms. Using density conservation to linear 
order and making the distant observer approximation, the 
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redshift-space galaxy density perturbation can be written as 



Sg" — Sg + 8 V fI 



(Al) 



where /j, = f ■ k, S g is the real space galaxy density per- 
turbation and 5 V is the velocity divergence. This is related 
to the density perturbation 8dm via 8 V — f5 am , where 
f(fl) = d log <5/d log a ~ fi ' 6 , and a is the scale factor. 

The other effect is the suppression of power due to 
the virial motions within haloes; this affects the f-halo 
term (Sheth f996). The assumption of isotropic, isothermal, 
Maxwellian motions within halos means that the effect can 
be modeled as a convolution with a Gaussian. In particular, 
the density contrast in redshift space is 



S g e 



-(kanY /2 



(A2) 



Scoccimarro (2004) discusses why these descriptions (equa- 
tions Al and A2) of redshift-space distortions are rather 
crude. For our purposes, they represent reasonable first ap- 
proximations to a more sophisticated model. 

Let n(M, V) denote the number density of patches of 
mass M and volume V , and let N(m\M, V) be the average 
number of m haloes in regions of volume V which contain 
mass M. The isotropized redshift space power spectrum is 
obtained by averaging (Sg S ) 2 over fx, m and M. In particular, 
the 1-halo term can be written as, 
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(k\S)= / dMn(M,V) / dmN(m\M,V) 

JM mi „ Jo 

[(2N s \m) u(k\m)?Ri(k(T) + (N s \m) 2 \u{k\m)\ 2 ^ 2 {ka)] 
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where 



K p (a = fca[p/2] 1/2 ) = 



y/H erf (a) 



(A3) 



(A4) 



for p = 1,2, and ns-gai is the number density of galaxies 
surrounded by regions containing at least N m in other galax- 
ies: 



(A7) 



where 

fM max 

F v = f dM n(M, V) B(M, V) 

J M min 

x / dmN(m\M,V)— »i(fca)M(fc|m) 
Jo Ps 

F g = dMn(M,V)B(M,V) 

x / dmN(m\M,V)^^^ 1 (ka)u{k\rn)- 

JO n 6 — gal 

Phin{k\Rp) denotes the power spectrum associated with set- 
ting the linear theory correlation function to —1 on scales 
smaller than the diameter of a patch 2R P . This truncation 
has little effect on small kR p <JC 1, where Puu{k\Rp) m 
Phin{k). And the factor B{M, V) describes the bias asso- 
ciated with the clustering of the patches; it depends on the 
abundance of such patches (see Abbas & Sheth 2005 for de- 
tails). 

Similarly, the 2h-lp term can be written as 

rM max , 2 1 , 

dMn(M,V)(F g 2 + -F^F'g + -F v 2 ) 



pgal 
r 1h-\p 



(k\S) = / 



X [Piin(fc) - PLin(k\Rp)\, 



(A8) 



where PLin(fc) — PLin(k\R p ) denotes the power spectrum as- 
sociated with setting the linear theory correlation function 
to zero on scales larger than the diameter of a patch 2R P , 
and 



K = f 

Jo 



dmN{m\M,V) — Ki(fca)fo(m) u(k\m) 
Ps 



?s = [ M dmN(m\M, V) 8ti (fcg)b(m) u(k\m). 

(A9) 



Here b(m) is the bias factor of haloes of mass m (from Sheth 
& Tormen 1999). The correlation function, £(s), is obtained 
by taking the Fourier transform of the power spectrum P(k). 



rM max (N max ) 

ns-gai = / dMn(M,V) 

J M min (N min ) 
rM 

X 



rM 

/ dmN(m\M,V)(N gal \m). 
Jo 



(A5) 



Here, {N ga i\rn} — l + (iVs|m) is the average number of galax- 
ies occupying a halo of mass m (in our model, it is zero below 
some m,L\ c.f. equation 1). In practice, M m in(N m in) is ob- 
tained by varying N min until the value of this expression 
matches the observed number density. 

The two-halo term is more complex as it now has two 
types of contributions: pairs which are in the same patch 
(2h-lp), and pairs in different patches (2h-2p). The 2h-lp 
term can only be important on intermediate scales (i.e., 
those which are larger than the diameter of a typical halo 
but smaller than the diameter of a patch). The 2h-2p term 



pgal 
r 2h-2 P 



(F 2 + ^F v F g + ±F 2 )P Un (k\R p ), (A6) 



